NCU IISR System for NTCIR-12 MobileClick2

نویسندگان

  • Wen-Bin Han
  • Hung-Hsiang Wang
  • Richard Tzong-Han Tsai
چکیده

This paper describes our approach to the NTCIR-12 MobileClick task. First of all, we do some extra process on the baseline. Next, we try to use a totally different method from baseline which is machine learning. Finally, tune the two types into better situation and apply them to test data. Our system achieves an nDCG@3 score of 0.7415, nDCG@5 score of 0.764, nDCG@10 score of 0.8059, nDCG@20 score of 0.8732 and a Qmeasure score of 0.9004, outperforming the baseline a little bit.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NCU IISR System for NTCIR-11 MedNLP-2 Task

This paper describes NCU IISR’s Japanese ICD-10 Code Linking system for NTCIR-11 MedNLP. Our system uses Conditional Random Fields (CRFs) to label ICD-10 mentions and temporal expressions. We also use CRFs to detect the modalities of the ICD-10 mentions. To resolve the problem of ICD-10 mention normalization, we use the Lucene engine to link mentions to the corresponding ICD-10 database entries...

متن کامل

CUIS at the NTCIR-12 MobileClick2 Task

We present our approach for tackling the iUnit ranking and iUnit summarization subtasks of MobileClick2. We first conduct intent discovery based on latent topic modeling. Our iUnit ranking method exploits the discovered intents and considers the importance of an iUnit in each Web content document. We further develop our iUnit summarization model using the outcome from the iUnit ranking subtask....

متن کامل

NUTKS at NTCIR-12 MobileClick2: iUnit Ranking Subtask Using Topic Model

In this paper, NUTKS (Nagaoka University of Technology, Knowledge Systems Laboratory) reports the results of our participation at the NTCIR-12 MobileClick task, iUnit Ranking subtask. The authors have ranked iUnit by using similarity of iUnit word distribution based on LDA topics. Our system has recorded Q-measure score as 0.7392 and nDCG@20 score as 0.6334. In baseline system, the recorded Qme...

متن کامل

Improving iUnit Retrieval with Query Classification and Multi-Aspect iUnit Scoring: The IISR System at NTCIR-11 MobileClick Task

This paper describes our approach to the NTCIR-11 MobileClick task. Based on the assumption that different user intentions should be handled by different extraction/retrieval strategies, we first classify each query into one of our eight defined query types and set the weights of the extraction methods accordingly. Next, we extract the relevant parts of the search results and rank the extracted...

متن کامل

Enhance Japanese Opinionated Sentence Identification Using Linguistic Features: Experiences of the IISR Group at NTCIR-8 MOAT Task

Statistics and Date The non-opinionated sentences tend to have numbers and these numbers may be statistics, period of time, or a specific time.  Sentence containing a statistics data バリ島駐在の日本人向け旅行社によると、バリを訪れる年間約30万人 の日本人観光客のうち6割以上が女性客。  Sentence containing a period of time 日本でも在日米軍が95年12月から96年1月に沖ノ鳥島の演習で 劣化ウラン弾1520発を誤射したことが判明。  Sentence containing a specific time 昨年10月12日、インドネシア・バリ島のディスコで爆弾テ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016